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Abstract 

Percolation on complex networks has been 
used to study computer viruses, epidemics, 
and other casual processes. Here, we present 
conditions for the existence of a network spe- 
cific, observation dependent, phase transi- 
tion in the updated posterior of node states 
resulting from actively monitoring the net- 
work. Since traditional percolation thresh- 
olds are derived using observation indepen- 
dent Markov chains, the threshold of the pos- 
terior should more accurately model the true 
phase transition of a network, as the up- 
dated posterior more accurately tracks the 
process. These conditions should provide in- 
sight into modeling the dynamic response of 
the updated posterior to active intervention 
and control policies while monitoring large 
complex networks. 

1 INTRODUCTION 

Increasingly often, researchers are confronted with 
monitoring the states of nodes in large computer, so- 
cial, or power networks where these states dynamically 
change due to viruses, rumors, or failures that prop- 
agate according to the graph topology [2j |5j |8] . This 
class of network dynamics has been extensively mod- 
eled as a percolation phenomenon, where nodes on a 
graph can randomly "infect" their neighbors. 

Percolation across networks has a rich history in 
the field of statistical physics, computer science, and 
mathematical epidemiology. Here, researchers are typ- 
ically confronted with a network, or a distribution over 
the network topology, and extract fixed point attrac- 
tors of node configurations, thresholds for phase tran- 
sitions in node states, or distributions of node state 
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configurations [IJ [TJ |9| . In the field of fault detection, 
the nodes or edges can "fail", and the goal is to ac- 
tivate a subset of sensors in the network which yield 
high quality measurements that identify these failures 
[T2] . While the former field of research concerns it- 
self with extracting offline statistics about properties 
of the percolation phenomenon on networks, devoid 
of any measurements, the latter field addresses online 
measurement selection tasks. 

Here, we propose a methodology that actively tracks a 
causal Markov process across a complex network (such 
as the one in Figure [3]), where measurements are adap- 
tively selected. We extract conditions such that the 
updated posterior probability of all nodes "infected" 
is driven to one in the limit of large observation time. 
In other words, we derive conditions for the existence 
of an epidemic threshold on the updated posterior dis- 
tribution over the states. 

The proposed percolation threshold should more ac- 
curately reflect the true conditions that cause a phase 
transition in a network, e.g., node status changing 
from healthy /normal to infected/failed, than tradi- 
tional thresholds derived from conditions on predic- 
tive distributions which are devoid of observations or 
controls. 

Since most practical networks of interest are large, 
such as electrical grids, it is usually infeasible to sam- 
ple all nodes continuously, as such measurements are 
either expensive or bandwidth is limited. Given these, 
or other resource constraints, we present an informa- 
tion theoretic sampling strategy that selectively tar- 
gets specific nodes that will yield the largest informa- 
tion gain, and thus, better detection performance. 

The proposed sampling strategy balances the trade- 
off between trusting the predictions from the known 
model dynamics (from percolation theory) and ex- 
pending precious resources to select a set of nodes for 
measurement. 

We present the adaptive measurement selection prob- 



lem and give two tractable approximations to this 
subset selection problem based upon the joint and 
marginal posterior distribution, respectively. A set 
of decomposable Bayesian filtering equations are pre- 
sented for this adaptive sampling framework and the 
necessary tractable inference algorithms for complex 
networks are discussed. We present analytical worst 
case performance bounds for our adaptive sampling 
performance, which can serve as sampling heuristics 
for the activation of sensors or trusting predictions 
generated from previous measurements. 

To the author's knowledge, this is the first attempt 
to extract a percolation threshold of an actively moni- 
tored network using the updated posterior distribution 
instead of the observation independent predictive dis- 
tributions. 

2 PROBLEM FORMULATION 

The objective of actively monitoring the n node net- 
work is to recursively update the posterior distribution 
of each hidden node state given various measurements. 
Specifically, the next set of m measurement actions 
(nodes to sample), m <C n, at next discrete time are 
chosen such that they yield the highest quality of in- 
formation about the n hidden states. The condition 
on m <C n simulates the reality of fixed resource con- 
straints, where typically only a small subset of nodes 
in a large network can be observed at any one time. 

Here, the hidden states are discrete random variables 
that correspond to the states encoded by the percola- 
tion process on the graph. Here, the graph Q = (V, £), 
with V representing the set of nodes and £ corre- 
sponding to the set of edges. Formally, we will as- 
sume a state-space representation of a discrete time, 
finite state, partially observed Markov decision process 
(POMDP). Here, 

z k = {zi...,zn (i) 

represents the joint hidden states, e.g., healthy or in- 
fected 

Y fc = {Y« ...,Y«} (2) 

represents the m observed measurements obtained at 
time fc, e.g., biological assays or PINGing an IP address, 
and 

a fc = K,...,an (3) 

represents the m actions taken at time /c, i.e., which 
nodes to sample. Here, Y k \ continuous/categorical 
valued vector of measurements, which is induced by 
action a 3 k , a J k G *4, with A = {l,...,n} confined 
to be the set of all n individuals in the graph, and 
Z\ G {0, 1, . . . , r}. Since the topology of Q encodes the 
direction of "flow" for the process, the state equations 



may be modeled as a decomposable partially observed 
Markov process: 

Yl = {(Zl)+wi (4) 
Z* = ( 5 ) 

Here, rj(i) = {j : £ (Vi,Vj) ^ 0} is the neighborhood 
of z, f(Z k ) is a non-random vector- valued function, w l k 

is measurement noise, and h [z k _ x , {^_i}je77(i)) is a 
stochastic equation encoding the transition dynamics 
of the Markov process (see Figure [I] for a two node 
graphical model representation). 




Figure 1: Partially Observed Markov Structure for i 
and j for £ (V;,v/) £ 

2.1 BAYESIAN FILTERING 

In our proposed framework for actively monitoring the 
hidden node states in the network, the posterior dis- 
tribution is the sufficient statistic for inferring these 
states. The general recursion for updating the joint 
posterior probability given all past and present obser- 
vations is given by the standard Bayes update formula: 

P (Z fe |Y fc ) = -M^-p^Yt 1 ) (6) 

with 

^ZfelYg" 1 ) = P(Z*|Zfc-i = zMZfc-i = zlYg" 1 ). 

zG{0,l,...,r}™ 

(7) 

The Chapman-Kolmogorov equations provide the con- 
nection between the posterior update Q and the 
distribution resulting from the standard percolation 
equations. In the former, the updates are conditional 
probabilities that are conditional on past observations, 
while in the latter, the updates are not dependent on 
observations. 

The local interactions in the graph Q imply the follow- 
ing conditional independence assumptions: 

n 

/(Y fc |Z fe ) = n/(m^)- (8) 

i=l 



n 

pCZfeiZfc-i) = ]Jp (zi\zi_ x , {z^W)) (9) 

i=l 

where the likelihood term is defined in Q and the 
transition dynamics are defined in This decom- 
posable structure allows the belief state (posterior ex- 
cluding time k observations) update, for the i th node 
in £?, to be written as: 

KZfclYg" 1 ) = Yl P(Z*\K-i = z)p(K-i = zlYg" 1 ) 

zG{0,l,...,r}IIP a ll 

(10) 

with the parent set, pa = {rj(i),i}. Unfortunately, 
for highly connected nodes in Q, this marginal update 
becomes intractable. It thus must be approximated 

Ga mam- 

2.2 INFORMATION THEORETIC 
ADAPTIVE SAMPLING 

In most real world situations, acquiring measurements 
from all n nodes at any time k is unrealistic, and 
thus, a sampling policy must be exploited for mea- 
suring a subset of nodes |6j H2]- Since we are con- 
cerned with monitoring the states of the nodes in the 
network, an appropriate reward is the expected in- 
formation gain between the updated posterior, p k = 
p(Zfc|{YJ.}i GafcJ Yq -1 ), and the belief state, p k \k-i = 
PtZfclY*- 1 ): 



a fc = argmax aC ^E V a ({Y*J iGa ) |Yg 



rk-1 



(ii) 



V a ({Yi} i6a ) = V a (p k \\Pk\k-i) , < a < 1 (12) 
with a-Divergence 

V a {p\\ q ) = -^- l \og^ q [{p/q) a ]) (13) 

for distributions p and q with identical support. 



The reward in (11) has been widely applied to 



multi-target, multi-sensor tracking for many prob- 
lems including, sensor management and surveillance 
lEj. Note that lim a _>i£> a (p||g) -> V KL (p\\q), 
where T>kl(p\\q) is the Kullback-Leibler divergence 
between p and q. The expectation in ( [TT] ) is taken with 
respect to the conditional distribution ^(Y/c|Yq _1 ) 



given the previous measurements Yq _1 and actions 
SLk- In practice, the expected information divergence 
in ( pTj ) must be evaluated via Monte-Carlo methods. 



Also, the maximization in (11) requires enumeration 
over all (^) actions (for subsets of size m), and there- 
fore, we must resort to Greedy approximations. We 
propose incrementally constructing the set of actions 
at time k : a^, for j = 1, . . . , m, according to: 



argmax ie ^ afc E [v a (v l fc , {Y{} jeak ) (Yg" 1 



(14) 



Both (11) and (14) are selecting the nodes to sam- 



ple which yield maximal divergence between the per- 
colation prediction distribution (belief state) and the 
updated posterior distribution, averaged over all possi- 



ble observations. Thus ( 11 ) provides a metric to assess 



whether to trust the predictor and defer actions until 
a future time or choose to take action, sample a node, 
and update the posterior. 



2.2.1 



Lower Bound on Expected 
a-Divergence 



Since the expected a-Divergence in (11) is not closed 
form, we could resort to numerical methods for esti- 
mating this quantity. Alternatively, one could specify 
an analytical lower-bound that could be used in-lieu of 
numerically computing the expected information gain 



in (11) or (14). 



We begin by noting that the expected divergence be- 
tween the updated posterior and the predictive dis- 
tribution (conditioned on previous observations) dif- 
fer only through the measurement update factor, 
fk/9k\k-i (( p) re- written): 

0]] 

fk 



E, 



k | k 



E 



9k\k- 



1 



_! \Pol (Pk\\Pk\k~l 
lQ g E Pk\k- 



9k\k-l 



(15) 



So, if there is significant overlap between the likeli- 
hood distributions of the observations, the expected 
divergence will tend to zero, implying that there is 
not much value-added in taking measurements, and 
thus, it is sufficient to use the percolation predictions 
for inferring the states. 

It would be convenient to interchange the order of the 



conditional expectations in (15). It is easily seen that 



Jensen's inequality yields the following lower bound 
for the expected information gain 



E 9k\k-i \P« [Vk\\Vk\k-x)\ 



> 



a — 1 



E 



9k\k-l 



fk 



9k\k-l 



(16) 



Here, the inner conditional expectation can be ob- 
tained from V a (/fc||<7fc|fc_i), which has a closed form 
for common distributions (e.g., multivariate Gaus- 
sians) [6]. 

3 ASYMPTOTIC ANALYSIS OF 
MARGINAL POSTERIOR 

For tracking the percolation process across (5, we have 
discussed recursive updating of the belief state. How- 
ever, computing these updates exactly is in general in- 
tractable. For the remainder of the paper, we will use 



Q and ([5| to directly update the marginal posterior 
distribution using the following matrix representation: 



Pfc(s) 



(17) 



with updated marginal posterior Pk( z ) = 
[p 1}k (z),...,p nyk (z)] T with p iyk (z) = p(Z % k = 

zlYlYt 1 ), D fc (z) = diag (^fcV^fclfc-i), 
and marginal belief state P/c|/c-i( z ) = 

[Pl,fe|/c-l(^),---,Pn,fe|/c-l(^)] T With p i>fe |fe-l(^) = 

K^^IYg- 1 )- 



Note that for z ^ a&, (D/ C (z)) ii = 1, and Pi^( z ) = 
Given that we can find an efficient way 
of updating Pk\k-i( z )> according to the transition dy- 
namics ([5]), we can solve a modified version of (14), for 
j = l,...,ra: 



argmax i(EtA \ a , 



E 



V a (Y*) lYg- 1 



(18) 



D a (Yi) = P a {piA*)\\PiMk-i( z )) , < a < 1. 



(19) 



3.1 TOTAL DIVERGENCE OF UPDATED 
POSTERIOR 

One interesting property of the Bayesian filtering equa- 
tions is that the updated posterior can be written as 
a perturbation of the predictive percolation distribu- 
tion through the following relationship (z omitted for 
clarity) : 

Pk = D fcPfe|/c-l = Pfc|fc-1 + ( D /c - I) Pk\k-1- ( 20 ) 

Hence, when the sensors do a poor job in discriminat- 
ing the observations, D& « I, we have p k « Pk\k-i- ^ 
is of interest to determine when there is significant dif- 
ference between the posterior update and the prior up- 
date specified by the standard percolation equations. 
Recall that the updated posterior is, in the mean, 
equal to the predictive distribution, E pJYq -1 = 

Pfc|fc— i- T ne total deviation of the updated posterior 
from the percolation distribution can be summarized 
by computing the trace of the following conditional 
covariance: 



tr 



tr E 



(p*-e[: 



P fc |Y, 



(R[p fc |Y fc - 1 ]) = 

"])(»-'[: 



(21) 



Using (20) and properties of the trace operator, we 



obtain the following measure of total deviation of the 
updated posterior from the predictive distribution in 
terms of f k and g k \k-i- 



tr R 



Pk\ 



rk-l 



= tr (E [(D fc - I) 2 lYg" 1 " 



k\k-l 



(22) 



with P k \k-i = Pfe|fe-iPfe|fe-i- The conditional expec- 
tation in (22) is the Pearson x 2 divergence between 
distributions fi ik and g^k-ij f° r a ^ This joint mea- 
sure of deviation is analytical for particular families of 
distributions and thus can be used as an alternative 
measure of divergence for activation of sensors [6 . 

3.2 PERCOLATION THRESHOLD OF 
UPDATED POSTERIOR 

There has recently been significant interest in deriv- 
ing the conditions of a percolation/epidemic threshold 
in terms of transition parameters and the graph ad- 
jacency matrix spectra for two state causal Markov 
processes [TJ [4j |9]. Such thresholds yield conditions 
necessary for an epidemic to arise from a small number 
of "infections" . Knowledge of these conditions are par- 
ticularly useful for designing "robust" networks, where 
the probability of epidemics is minimized. 

Percolation thresholds are typically obtained by ex- 
tracting the sufficient conditions of the network and 
model parameters for the node states to be driven 
to their stationary point, with high probability. The 
probability of these events are usually computed using 
the observation independent percolation distribution 

Bung. 

We use the results in [TJ [4] to derive a percolation 
threshold based upon the updated posterior distribu- 
tion assuming a restricted class of two-state Markov 
processes. These conditions should more accurately 
model the current network threshold since the poste- 
rior distribution tracks a particular "disease" trajec- 
tory better than the observation independent percola- 
tion distribution. 



Formally, Z\ e {0,1}, f$ = f(Y\\Z\ 
the conditional likelihood for node z, pi ik 
llY^Yg" 1 ), and p i>fc - - mk-u 

will assume that Z k 
of the system. 



z) is 



p(zi = 

p(Zt = 1IY*- 1 )- Here, we 
is the unique absorbing state 



The Bayes update for p iyk can be written as (i sub- 
script omitted for clarity): 



Pk 



f (i) 
Jk 



fk ] Pk\k-l + /fc W; (! — P/c|fc-l) 

f ( 
Ik 



Pk\k-1 



fl 1] /fl 0) 



f (i)_ f (o) 
1 ' AO) Pk\k-1 



Pk\k-1 



f 

1 + ^mPk\k-i 
j k 



Pk\k-1- 



(23) 



There are three different sampling/observation depen- 
dent possibilities for each individual at time k: case 



(1) , i is not sampled and therefore, p k = Pk\k-ii case 

(2) , Af k > 0, and case (3), Af k < 0. 

We first derive a tight-upper bound for cases (2) and 

(3) of the form p k < c k Pk\k-i- For the remainder of 
the analysis we will assume that \^fp k \ k -i\ < 1 f° r 

f k 

cases (2) and (3) (see Appendix). 

Using the upper-bounds derived in the Appendix, and 
after gathering all n nodes, we have the following 
element-wise upper-bound on the updated belief state: 



k\k-l 



(B k + O k ) Pk 



\k-l- 



(24) 



with 



diag(& i)fe ) and O k 



diag \ l{Af itk <0}O [^rrPi,k\k-i J J where I {A / 4jfc <o} 
is the indicator function for the event Afi k < 0. 



Thus far, we have established, under the assumptions 
of \^fPk\k-i\ < 1? an upper-bound for the updated 

f k 

posterior in terms of observation likelihoods and the 



belief state ( 24 ) 



Next, consider the restricted class of two-state Markov 
processes on for which we can produce a bound of 
the form 

Pfc|fc-i < S Pk-i ( 2 5) 

where S contains information about the transition pa- 
rameters and the topology of the network. 

It turns out that the SIS model of mathematical epi- 
demiology falls within this restricted class of percola- 
tion problems pQ. 

The SIS model on a graph assumes that each of the 
n individuals are in states or 1, where corresponds 
to susceptible and 1 corresponds to infected. At any 
time /c, an individual can receive the infection from 
their neighbors, r/(i), based upon their states at k — 1. 

Under this SIS model in [1 



S = (1- 7 )I + /?A 



(26) 



where the Markov transition parameters 7 is the prob- 
ability of i transitioning from 1 to 0, (3 is the proba- 
bility of transmission between neighbors i and j, and 
A is the graph adjacency matrix (see Figure [2]). 



Returning to the derivation, using the bound (25), we 
have, by induction, the following recursion: 

Pk < C fcPfc|fc _ 1 <C fc Sp fe _ 1 <(C,S---C 1 S)p 
= (B fe S---B 1 S)p + (9 Cfc s (27) 

where we have lumped the higher order modes and 
higher order cross-terms into Oc k s- 



The dominant mode of decay of the updated poste- 
rior may be found by investigating the following eigen- 
decomposition: 



Bz,S 



\i =1 




(28) 



with ej = [0,...,0,1,0, ...,0] T (1 at j th element). 
Without loss of generality, we can assume the eigen- 
values of S are listed in decreasing order, |Ai| > • • • > 
|A n |. Now rewriting (28), we have 



B fe S = (b jk e jk ej k + O b ) (A lUl uf + O s ) 



= (Ai b jk e jk ej k ui uf + O bs 



(29) 



where bj k 



max ie{lr .. )n} 6 i)fc and the B ,O s ,0 B s 
variables corresponds to the higher order terms. In- 



serting (29) into (27), and matching the largest eigen- 



values of Bfe with Ai we obtain 
Pfe < (B fe S...B 1 S)p + O c ,s 

k / k \ 

= *i Il&i, II (^l^i) Po +0(^X30) 



1 = 1 



\l=l 



Thus, at large fc, the dominant mode of the posterior 
goes as A^ nf=i ^ji (^ ne m °des in 0(cp k ) decay faster 
than the dominant mode presented above). 

We can see that if the spectral radius of S is less than 
one, I Ai| < 1, then for large fc, p k — > 0, which is the 
unique absorbing state of the system. 

This epidemic threshold condition on Ai has been pre- 
viously established for unforced 5'/5'-percolation pro- 
cesses pQ. However, in the tracking framework, the 
rate at which the posterior decays to the susceptible 
state is perturbed by an additional measurement de- 
pendent factor, nf=i fyz • 

This measurement-dependent dominant mode of the 
posterior should more accurately model the true dy- 
namic response of the node states better than that 
in pQ since the posterior better tracks the truth than 
the unforced predictive distribution. Additionally, this 
dominant mode of the updated posterior distribution 
allows one to simulate the response of the percolation 
threshold to intervention and control actions which are 
designed to increase the threshold, such that the prob- 
ability of epidemics is minimized. 

4 NUMERICAL EXAMPLE 

Here, we present results of simulations of our adaptive 
sampling for the active tracking of a causal Markov 
ground truth process across a random 200 node, scale- 
free network (Figure [3|. Since the goal in tracking 




Figure 2: SIS Markov Chain for Node i Interacting 
with the Infected States of its Neighbors 



is to accurately classify the states of each node, we 
are interested in exploring the detection performance 
as the likelihood of an epidemic increases through the 
percolation threshold for this graph. 

One would expect different phase transitions (thresh- 
olds) in detection performance for various sampling 
strategies, ranging from the lowest threshold for un- 
forced percolation distributions to highest for a con- 
tinuous monitoring of all n nodes. We will present a 
few of these detection surfaces that depict these phase 
transitions for the unforced percolation distribution, 
random m — 40 node sampling, and our proposed in- 
formation theoretic adaptive sampling of m — 40. 

Here, we will restrict our simulations to the two-state 
SIS model of mathematical epidemiology described 
above. 




Figure 3: 200 Node Scale-Free Graph Q = (V,£) 

The sensor models Q, are of the form of two- 
dimensional multivariate Guassians with common co- 
variance and shifted mean vector. The transition dy- 
namics of the i th individual ([5]), for the SIS model is 
given by: 



4iz£f )} ~ (i-7)4-i+(i-4-i) 



i-n 

j£r](i) 

(31) 



where Z z k _ 1 G {0, 1} is the indicator function of i be- 
ing infected at time k — 1. The transmission term 
between i and rj(i) is known the Reed- Frost model 



[U ID |8]. Since the tail of the degree distribution of 
our synthetic scale-free graph contains nodes with de- 



gree greater than 10, updating (10) exactly is unreal- 



istic and we must resort to approximate algorithms. 
Here, we will assume the mean field approximation 
used by pQ for this SIS model, resulting in the fol- 
lowing marginal belief state update for the i th node of 
infected (Z{ = 1): 



Pi,k\k-i = (l-7)Pt,jfe-i+(l-Pt,jfe-i) 



(32) 



Equation (32) allows us to efficiently update the 



marginal belief state directly for all n nodes which are 
then used for estimating the best m measurements us- 



ing (18). 



As we are interested in detection performance as a 
function of time and epidemic intensity, the Area Un- 
der the ROC Curve (AUR) is a natural statistic to 
quantify the detection power (detection of the infected 
state). The AUR is evaluated at each time fc, each SIS 
percolation intensity parameter 

t = Ph (33) 

and over 500 random initial states of the network. For 
the SIS model, r is the single parameter (aside from 
the topology of the graph) that characterizes the in- 
tensity of the percolation/epidemic. It is useful to 
understand how the detection performance varies as 
a function of epidemic intensity, as it indicates how 
well the updated posteriors are playing "catch-up" in 
tracking the true dynamics on the network. 

For this SIS model, the percolation threshold is de- 
fined as r c = l/Ai(A) where Ai(A) = max iG { lv ^ n } |A^| 
is the spectral radius of the graph adjacency matrix, 
A pQ . Values of r greater than r c imply that any infec- 
tion tend to become an epidemic, whereas those values 
less than r c imply that small epidemics tend to die out. 

For the network under investigation (Figure |3|, r c = 
0.1819. We see from Figure 4(a) that a phase transi- 
tion in detection power (AUR) for the unforced per- 
colation distribution does indeed coincide with the 
epidemic threshold r c . While the epidemic threshold 
for the random and adaptive sampling policies is still 
r c = 0.1819, the measurements acquired allow the pos- 
terior to better track the truth, but only up to their re- 
spective phase transitions in detection power (see Fig- 
ures [4(b)] and [4(c)]) . 

Figure |4(c)| confirms that the adaptive sampling bet- 
ter tracks the truth than randomly sampling nodes, 
while pushing the phase transition in detection per- 
formance to higher percolation intensities, r. We see 
that the major benefit of the adaptive sampling is ap- 
parent when conditions of the network are changing 




pling method. 



(a) AUR Surface for Unforced Prediction Distribution (no ev- 
idence acquired throughout the monitoring) 





(b) AUR Surface for Updated Posterior Distribution with m 
40 Random Measurements at Each Time k 



(c) AUR 
m = 40 
Each Time k 



Figure 5: Relative Frequency of Nodes Sampled (z- 
axis) of a Given Degree (a;- axis) Over Time (y-axis) for 
m = 40 Adaptive Sampling Strategy: a.) r = 0.125, 
b.) r = 0.2143, and c.) r = 0.5 

It is often useful for developing sampling heuristics and 
offline control/intervention policies to inspect what 
type of nodes, topologically speaking, is the adaptive 
sampling strategy targeting, under various network 
conditions (different values of r). In Figure [5| the 
relative frequency of nodes sampled with a particular 
degree is plotted against time (under the m = 40 adap- 
tive sampling strategy) for three different values of r 
(over 500 random initial conditions of the network). 

For the larger of the three values explored (r = 0.5 > 
r c ) we see that the sampling is approximately uniform 
across the nodes of each degree on the graph (Figure 
[5jc)). Therefore, under extremely intense epidemic 
conditions, the adaptive sampling strategy is target- 
ing all nodes of each degree equally, and therefore, it 
Surface for Updated Posterior Distribution with is sufficient to perform random sampling. For the two 
Information Theoretic Adaptive Measurements at i ower values of r, Figure ^a) and Figure ^b) (near 

r c ), we see that adaptive policy targets highly con- 
nected nodes more frequently than those of lesser de- 
gree and thus, it is more advantageous to exploit such 
a strategy, as compared to random sampling (see AUR 
surface in Figure 4(c)). 




Figure 4: Area under the ROC curve surface as a func- 
tion of percolation parameter r = and time 



moderately, at medium epidemic conditions. Beyond 
a certain level of percolation intensity, more resources 
will need to be allocated to sampling to maintain a 
high level of detection performance. 

A heuristic sampling strategy based on the topology 
of Q was also explored (results not shown) by sam- 
pling the "hubs" (highly-connected nodes). However, 
detection performance was only slightly better than 
random sampling and poorer than our adaptive sam- 



5 DISCUSSION 

In this paper, we have derived the conditions for a net- 
work specific percolation threshold using expressions 
for the updated posterior distribution resulting from 
actively tracking the process. These conditions recover 
the unforced percolation threshold derived in pQ but 
with an additional factor involving sensor likelihood 
terms due to measurements obtained throughout the 



monitoring. A term of the form flf=i bj l (derived 



in (|30j)) was shown to be the dominant mode of the 
updated posterior dynamic response to active inter- 
vention of immunizing the nodes (holding node states 
constant). The conditions of the percolation using the 
updated posterior should more accurately model the 
phase transition corresponding to a particular disease 
trajectory and therefore, enable a better assessment 
of immunization strategies and any subsequent obser- 
vations resulting from such actions. The framework 
presented above, along with the new posterior perco- 
lation threshold, should provide additional insight into 
active monitoring of large complex networks under re- 
source constraints. 

6 APPENDIX 



In case (2), when Afk > 0, we can re- write (23) in 
terms of an alternating geometric series: 
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(34) 



where we have used the fact that 1/(1 + \a\) < 1 + \a\. 
Recalling that p > p 2 for < p < 1, we have 
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(35) 



In case (3), when Afk < 0, and (23) can be represented 
as a geometric series: 
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Once again, using the p > p 2 bound, we obtain: 
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(37) 

A general inequality, that is equality for case (1), is of 
the form p k < c k p k \k-i with 



bk 



I A/ fc | 
f (o) 
J k 



, Af k > or Af k < 



with Ck = bk for cases (1) and (2) and Ck — bk 
O (^j^rPk\k-i^ for case (3). 
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